WHAT IS CLAIMED IS : 

1. A method of predicting functional similarity between two molecules 
comprising: 

deriving a first one-dimensional representation of a first molecule from 
5 distances between selected atoms of said first molecule; 

deriving a second one-dimensional representation of a second molecule 
from distances between selected atoms of said second molecule; and 
comparing said one dimensional representations. 

2. The method of Claim 1, wherein said one-dimensional representations 
1 0 comprise linear representations, 

3. The method of Claim 2, wherein said deriving comprises calculating a 
linear position for each of a plurality of selected atoms of said first molecule, 

4. The method of Claim 3, wherein said linear positions are selected to 
reduce the deviation between relative positions of said plurality of selected atoms along 

15 a line and three dimensional distances between said plurality of atoms in said first 
molecule. 

5. The method of Claim 3, wherein said deriving comprises calculating a 
linear position for each of a plurality of selected atoms of said second molecule. 

6. The method of Claim 5, wherein said comparing comprises ahgning said 
20 one dimensional representations so as to match linear positions of at least one of said 

selected atoms of said first molecule with at least one of said selected atoms of said 
second molecule. 

7. The method of Claim 6, wherein said matched atoms have the same 
element type. 

25 8. The method of Claim 1, wherein said matched atoms have the same 

hybridization state. 

9. The method of Claim 1, wherein at least some of said distances are 
derived from molecular topology. 

10. The method of Claim 9, wherein at least some of said distances are 
30 derived from bond counts. 
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11. The method of Claim I, wherein at least some of said distances are 
derived from three dimensional atomic coordinates. 

12. A computer implemented method of comparing molecules comprising: 
representing a first molecule as a first set of selected atoms, wherein each 

5 atom of said first set is associated with an atom type and a scalar value, wherein 

the scalar values are derived from distances between said selected atoms; 

representing a second molecule as a second set of selected atoms, 
wherein each atom of said second set is associated with an atom type and a 
scalar value, wherein the scalar values are derived from distances between said 
1 0 selected atoms; and 

comparing said atom types and scalar values. 

13. The method of Claim 12, wherein said scalar value represents a linear 
position. 

14. The method of Claim 13, wherein each atom of said set is associated 
15 with a second scalar value, said second scalar value representing a width centered about 

each of said linear positions. 

15. The method of Claim 13, wherein said comparing comprises: 

aligning the linear positions of an atom in the first molecule with an atom 
of the same type in the second molecule such that their lengths completely 
20 overlap; and 

evaluating the amount of overlap between atoms of the first molecule 
and atoms of the same type of the second molecule. 

16. The method of Claim 15, comprising repeating the aUgning and 
evaluating steps so as to evaluate the overlap at all linearly aligned positions of atom 

25 pairs having the same type. 

17. The method of Claim 14, wherein said second scalar value is the same 
for all of said selected atoms. 

18. A method of representing a non-linear three dimensional configuration of 
a molecule made up of a plurality of bonded atoms, said method comprising assigning, 

30 to at least some of said atoms, a position along a line so as to define a set of linear 
distances between each of said selected atoms, wherein at least some of said linear 
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distances are not equal to the corresponding three dimensional distances between the 
same atoms in said molecule. 

19. The method of Claim 18 comprising reducing a total deviation between 
said set of linear distances and the corresponding three dimensional distances between 

5 the same atoms in said molecule. 

20. A method of representing a non-Hnear three dimensional configuration of 
a molecule made up of a plurality of bonded atoms, said method comprising assigning, 
to at least some of said atoms, a position along a line so as to define a set of linear 
distances between each of said selected atoms, wherein at least some of said linear 

10 distances are not equal to corresponding topologically defined distances between the 
same atoms in said molecule. 

21. The method of Claim 20 comprising reducing a total deviation between 
said set of linear distances and the corresponding topological distances between the 
same atoms in said molecule. 

15 22. A method of selecting for further testing and analysis a subset of 

molecules from a library of molecular structures, said method comprising: 

storing linear representations of said molecular structures in a database, 
said linear representations being derived fi'om three dimensional distances or 
topological distances between atoms of said molecular structures; 
20 deriving a linear representation of a molecule having known biochemical 

activity fi'om three dimensional distances or topological distances between 
atoms of said molecule; 

comparing said linear representation of said molecule having known 
biochemical activity with said linear representations of said molecular structures 
25 in said database. 

23 . A method of molecular parameterization comprising : 
selecting a set of atoms in said molecule; 

deriving a set of scalar values from distances between said selected 

atoms; 

30 assigning to each of said selected atoms a parameter set including an 

atom type and one of said set of scalar values. 
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24. The method of Claim 23, wherein said set of scalar values represents a 
set of linear positions. 

25. The method of Claim 24, additionally comprising assigning a second 
scalar value to each of said selected atoms, said second scalar value representing a width 

5 centered about each of said linear positions. 

26. The method of Claim 25, wherein said second scalar value is the same 
for all of said selected atoms. 

27. The method of Claim 23, wherein said distances comprise three 
dimensional distances. 

10 28. The method of Claim 23, wherein said distances comprise topological 

distances. 

29. A method of drug discovery comprising comparing molecules that have 
been parameterized according to the method of Claim 23. 

30. A computer readable storage medium having stored thereon structural 
15 representations of molecules for retrieval by a computer implemented molecular 

screening program, wherein at least one of said structural representations comprises a 
list of selected atoms in said molecule, wherein each of said selected atoms is associated 
with an atom type and a scalar value. 

31. The method of Claim 30, wherein said scalar value is derived from 
20 distances between said selected atoms in said molecule. 

32. The method of Claim 31, wherein said distances comprise three 
dimensional distances. 

33. The method of Claim 31, wherein said distances comprise topological 
distances. 

25 [3^. The computer readable storage medium of Claim 30, wherein said scalar 

value comprises a position in a linear representation of said molecule. 

(^5) A computer implemented method of predicting one or more chemical 
properties of a molecule comprising: 

retrieving a linear representation of a first molecule having a known 
30 chemical property, said linear representation comprising a set of atoms and 

associated linear positions; 
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retrieving a linear representation of a second molecule, said linear 
representation comprising a set of atoms and associated linear positions; and 
comparing said linear representations. 
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